AITopics | Naha

--In this work, we investigate causal learning of independent causal mechanisms from a Bayesian perspective. Confirming previous claims from the literature, we show in a didactically accessible manner that unlabeled data (i.e., cause realizations) do not improve the estimation of the parameters defining the mechanism. Furthermore, we observe the importance of choosing an appropriate prior for the cause and mechanism parameters, respectively. Specifically, we show that a factorized prior results in a factorized posterior, which resonates with Janz-ing and Sch olkopf's definition of independent causal mechanisms via the Kolmogorov complexity of the involved distributions and with the concept of parameter independence of Heckerman et al. Impact Statement --Learning the effect from a given cause is an important problem in many engineering disciplines, specifically in the field of surrogate modeling, which aims to reduce the computational cost of numerical simulations. Causal learning, however, cannot make use of unlabeled data - i.e., cause realizations - if the mechanism that produces the effect is independent from the cause. In this work, we recover this well-known fact from a Bayesian perspective.

artificial intelligence, machine learning, realization, (16 more...)

arXiv.org Machine Learning

doi: 10.1109/TAI.2024.3522867

2504.01424

Country:

Europe > Austria > Styria > Graz (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Dominican Republic (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.70)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers

Vashistha, Ritwik, Farahi, Arya

arXiv.org Machine LearningJan-26-2025

As probabilistic models continue to permeate various facets of our society and contribute to scientific advancements, it becomes a necessity to go beyond traditional metrics such as predictive accuracy and error rates and assess their trustworthiness. Grounded in the competence-based theory of trust, this work formalizes I-trustworthy framework -- a novel framework for assessing the trustworthiness of probabilistic classifiers for inference tasks by linking local calibration to trustworthiness. To assess I-trustworthiness, we use the local calibration error (LCE) and develop a method of hypothesis-testing. This method utilizes a kernel-based test statistic, Kernel Local Calibration Error (KLCE), to test local calibration of a probabilistic classifier. This study provides theoretical guarantees by offering convergence bounds for an unbiased estimator of KLCE. Additionally, we present a diagnostic tool designed to identify and measure biases in cases of miscalibration. The effectiveness of the proposed test statistic is demonstrated through its application to both simulated and real-world datasets. Finally, LCE of related recalibration methods is studied, and we provide evidence of insufficiency of existing methods to achieve I-trustworthiness.

calibration, data mining, machine learning, (17 more...)

arXiv.org Machine Learning

2501.15617

Country:

North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Michigan > Genesee County > Flint (0.04)
North America > United States > Florida > Broward County (0.04)
(2 more...)

Genre:

Research Report > Experimental Study (0.67)
Research Report > New Finding (0.46)

Industry:

Law (0.94)
Health & Medicine > Therapeutic Area (0.46)
Government > Regional Government (0.46)

Technology:

Information Technology > Data Science > Data Mining (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.54)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Adaptive Refinement Protocols for Distributed Distribution Estimation under $\ell^p$-Losses

Yuan, Deheng, Guo, Tao, Huang, Zhongyi

arXiv.org Artificial IntelligenceNov-8-2024

Consider the communication-constrained estimation of discrete distributions under $\ell^p$ losses, where each distributed terminal holds multiple independent samples and uses limited number of bits to describe the samples. We obtain the minimax optimal rates of the problem in most parameter regimes. An elbow effect of the optimal rates at $p=2$ is clearly identified. To show the optimal rates, we first design estimation protocols to achieve them. The key ingredient of these protocols is to introduce adaptive refinement mechanisms, which first generate rough estimate by partial information and then establish refined estimate in subsequent steps guided by the rough estimate. The protocols leverage successive refinement, sample compression, thresholding and random hashing methods to achieve the optimal rates in different parameter regimes. The optimality of the protocols is shown by deriving compatible minimax lower bounds.

artificial intelligence, machine learning, protocol, (16 more...)

arXiv.org Artificial Intelligence

2410.06884

Country:

Oceania > Palau (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(8 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Normalising Flow-based Differentiable Particle Filters

Chen, Xiongjie, Li, Yunpeng

arXiv.org Artificial IntelligenceMar-3-2024

Recently, there has been a surge of interest in incorporating neural networks into particle filters, e.g. differentiable particle filters, to perform joint sequential state estimation and model learning for non-linear non-Gaussian state-space models in complex environments. Existing differentiable particle filters are mostly constructed with vanilla neural networks that do not allow density estimation. As a result, they are either restricted to a bootstrap particle filtering framework or employ predefined distribution families (e.g. Gaussian distributions), limiting their performance in more complex real-world scenarios. In this paper we present a differentiable particle filtering framework that uses (conditional) normalising flows to build its dynamic model, proposal distribution, and measurement model. This not only enables valid probability densities but also allows the proposed method to adaptively learn these modules in a flexible way, without being restricted to predefined distribution families. We derive the theoretical properties of the proposed filters and evaluate the proposed normalising flow-based differentiable particle filters' performance through a series of numerical experiments.

differentiable particle filter, particle filter, proc, (14 more...)

arXiv.org Artificial Intelligence

2403.01499

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > New York (0.04)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
(19 more...)

Genre: Research Report (1.00)

Add feedback

Score-based Causal Representation Learning: Linear and General Transformations

Varıcı, Burak, Acartürk, Emre, Shanmugam, Karthikeyan, Kumar, Abhishek, Tajer, Ali

arXiv.org Artificial IntelligenceFeb-1-2024

This paper addresses intervention-based causal representation learning (CRL) under a general nonparametric latent causal model and an unknown transformation that maps the latent variables to the observed variables. Linear and general transformations are investigated. The paper addresses both the \emph{identifiability} and \emph{achievability} aspects. Identifiability refers to determining algorithm-agnostic conditions that ensure recovering the true latent causal variables and the latent causal graph underlying them. Achievability refers to the algorithmic aspects and addresses designing algorithms that achieve identifiability guarantees. By drawing novel connections between \emph{score functions} (i.e., the gradients of the logarithm of density functions) and CRL, this paper designs a \emph{score-based class of algorithms} that ensures both identifiability and achievability. First, the paper focuses on \emph{linear} transformations and shows that one stochastic hard intervention per node suffices to guarantee identifiability. It also provides partial identifiability guarantees for soft interventions, including identifiability up to ancestors for general causal models and perfect latent graph recovery for sufficiently non-linear causal models. Secondly, it focuses on \emph{general} transformations and shows that two stochastic hard interventions per node suffice for identifiability. Notably, one does \emph{not} need to know which pair of interventional environments have the same node intervened.

hard intervention, intervention, node, (17 more...)

arXiv.org Artificial Intelligence

2402.00849

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.13)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > New York > Rensselaer County > Troy (0.04)
(11 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Stochastic Bayesian Optimization with Unknown Continuous Context Distribution via Kernel Density Estimation

Huang, Xiaobin, Song, Lei, Xue, Ke, Qian, Chao

arXiv.org Artificial IntelligenceDec-20-2023

Bayesian optimization (BO) is a sample-efficient method and has been widely used for optimizing expensive black-box functions. Recently, there has been a considerable interest in BO literature in optimizing functions that are affected by context variable in the environment, which is uncontrollable by decision makers. In this paper, we focus on the optimization of functions' expectations over continuous context variable, subject to an unknown distribution. To address this problem, we propose two algorithms that employ kernel density estimation to learn the probability density function (PDF) of continuous context variable online. The first algorithm is simpler, which directly optimizes the expectation under the estimated PDF. Considering that the estimated PDF may have high estimation error when the true distribution is complicated, we further propose the second algorithm that optimizes the distributionally robust objective. Theoretical results demonstrate that both algorithms have sub-linear Bayesian cumulative regret on the expectation objective. Furthermore, we conduct numerical experiments to empirically demonstrate the effectiveness of our algorithms.

algorithm, context variable, optimization, (14 more...)

arXiv.org Artificial Intelligence

2312.10423

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
North America > United States > Maryland > Baltimore (0.04)
North America > Canada > Quebec > Montreal (0.04)
(11 more...)

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine (0.99)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.84)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

An overview of differentiable particle filters for data-adaptive sequential Bayesian inference

Chen, Xiongjie, Li, Yunpeng

arXiv.org Artificial IntelligenceDec-14-2023

By approximating posterior distributions with weighted samples, particle filters (PFs) provide an efficient mechanism for solving non-linear sequential state estimation problems. While the effectiveness of particle filters has been recognised in various applications, their performance relies on the knowledge of dynamic models and measurement models, as well as the construction of effective proposal distributions. An emerging trend involves constructing components of particle filters using neural networks and optimising them by gradient descent, and such data-adaptive particle filtering approaches are often called differentiable particle filters. Due to the expressiveness of neural networks, differentiable particle filters are a promising computational tool for performing inference on sequential data in complex, high-dimensional tasks, such as vision-based robot localisation. In this paper, we review recent advances in differentiable particle filters and their applications. We place special emphasis on different design choices for key components of differentiable particle filters, including dynamic models, measurement models, proposal distributions, optimisation objectives, and differentiable resampling techniques.

differentiable particle filter, particle filter, proposal distribution, (14 more...)

arXiv.org Artificial Intelligence

2302.09639

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
Asia > Middle East > Jordan (0.04)
(24 more...)

Genre: Overview (1.00)

Add feedback

Score-based Causal Representation Learning with Interventions

Varici, Burak, Acarturk, Emre, Shanmugam, Karthikeyan, Kumar, Abhishek, Tajer, Ali

arXiv.org Artificial IntelligenceMay-1-2023

This paper studies the causal representation learning problem when the latent causal variables are observed indirectly through an unknown linear transformation. The objectives are: (i) recovering the unknown linear transformation (up to scaling) and (ii) determining the directed acyclic graph (DAG) underlying the latent variables. Sufficient conditions for DAG recovery are established, and it is shown that a large class of non-linear models in the latent space (e.g., causal mechanisms parameterized by two-layer neural networks) satisfy these conditions. These sufficient conditions ensure that the effect of an intervention can be detected correctly from changes in the score. Capitalizing on this property, recovering a valid transformation is facilitated by the following key property: any valid transformation renders latent variables' score function to necessarily have the minimal variations across different interventional environments. This property is leveraged for perfect recovery of the latent DAG structure using only \emph{soft} interventions. For the special case of stochastic \emph{hard} interventions, with an additional hypothesis testing step, one can also uniquely recover the linear transformation up to scaling and a valid causal ordering.

artificial intelligence, intervention, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2301.0823

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Asia > Japan > Honshū > Tōhoku > Iwate Prefecture > Morioka (0.04)
(11 more...)

Genre: Research Report (1.00)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)

Add feedback

Booking.com's Cautious Approach to AI

#artificialintelligenceMar-6-2023, 11:45:17 GMT

Here's what you need to know about the business of travel today. Hotels are increasingly investing in wellness as more travelers are placing an emphasis on their physical and mental well-being coming out of the pandemic. So how can hotels best take advantage of the booming consumer interest in wellness? Senior Hospitality Editor Sean O'Neill turns to one expert for answers in this week's Early-Check In column. Gregory Miller, an analyst at investment firm Truist Securities, said hotels have to tread carefully in wellness. He acknowledged that even luxury hotels have a hard time running spas profitably, noting that poorly executing a product could damage a company's reputation.

booking, cautious approach, wellness, (9 more...)

#artificialintelligence

Country:

North America > United States > District of Columbia > Washington (0.06)
Asia > Japan > Kyūshū & Okinawa > Okinawa Prefecture > Naha (0.06)
Asia > India > Maharashtra > Mumbai (0.06)

Industry:

Consumer Products & Services > Hotels (0.74)
Consumer Products & Services > Travel (0.56)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.60)

Add feedback

Reliable Robustness Evaluation via Automatically Constructed Attack Ensembles

Liu, Shengcai, Peng, Fu, Tang, Ke

arXiv.org Artificial IntelligenceNov-23-2022

Attack Ensemble (AE), which combines multiple attacks together, provides a reliable way to evaluate adversarial robustness. In practice, AEs are often constructed and tuned by human experts, which however tends to be sub-optimal and time-consuming. In this work, we present AutoAE, a conceptually simple approach for automatically constructing AEs. In brief, AutoAE repeatedly adds the attack and its iteration steps to the ensemble that maximizes ensemble improvement per additional iteration consumed. We show theoretically that AutoAE yields AEs provably within a constant factor of the optimal for a given defense. We then use AutoAE to construct two AEs for $l_{\infty}$ and $l_2$ attacks, and apply them without any tuning or adaptation to 45 top adversarial defenses on the RobustBench leaderboard. In all except one cases we achieve equal or better (often the latter) robustness evaluation than existing AEs, and notably, in 29 cases we achieve better robustness evaluation than the best known one. Such performance of AutoAE shows itself as a reliable evaluation protocol for adversarial robustness, which further indicates the huge potential of automatic AE construction. Code is available at \url{https://github.com/LeegerPENG/AutoAE}.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2211.12713

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(10 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Natural Language (0.68)

Add feedback

Filters

Collaborating Authors

Naha

On the Role of Priors in Bayesian Causal Learning

I-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers

Adaptive Refinement Protocols for Distributed Distribution Estimation under $\ell^p$-Losses

Normalising Flow-based Differentiable Particle Filters

Score-based Causal Representation Learning: Linear and General Transformations

Stochastic Bayesian Optimization with Unknown Continuous Context Distribution via Kernel Density Estimation

An overview of differentiable particle filters for data-adaptive sequential Bayesian inference

Score-based Causal Representation Learning with Interventions

Booking.com's Cautious Approach to AI

Reliable Robustness Evaluation via Automatically Constructed Attack Ensembles